北京邮电大学学报

  • EI核心期刊

北京邮电大学学报 ›› 2006, Vol. 29 ›› Issue (s2): 188-191.doi: 10.13190/jbupt.2006s2.188.312

• 论文 • 上一篇    下一篇

语音识别系统中上下文相关声学模型建模优化

彭 荻1, 刘 刚1, 郭 军1   

  1. 北京邮电大学 信息工程学院, 北京 100876
  • 收稿日期:2006-10-18 修回日期:1900-01-01 出版日期:2006-11-30 发布日期:2006-11-30
  • 通讯作者: 彭 荻

Refining Context-Dependent Tonal Acoustic Modeling in Mandarin LVCSR

PENG Di1, LIU Gang1, GUO Jun1   

  1. School of Information Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • Received:2006-10-18 Revised:1900-01-01 Online:2006-11-30 Published:2006-11-30
  • Contact: PENG Di

摘要:

在实验中发现,某些带调三音子的训练数据稀疏会引起识别错误率的上升,为了在一定程度上减少这种影响,提出了使用其无调三音子的模型参数对有调三音子进行初始化。此外还调整了决策树状态捆绑的停止门限,并且采用了混合度分量的自适应增长训练。在863语音库上的实验结果表明,所有这些获得了一定的音子识别性能提高,同时也一定程度上压缩了声学模型大小。

关键词: 声学模型, 语音识别, 三音子

Abstract:

In order to minimize the recognition errors caused by inaccurate model estimations from those toned triphones with limited training samples, we proposed to initialize toned triphones using their own toneless triphone model parameters. Besides, works concerning stopping criteria of decision tree state tying as well as mixture component adaptation are also explored to obtain better performance as well as reduce model scale. Experiments results have shown that, on the 863 corpus, along with all this improvements our system achieves certain increase of phone recognition rate, with much more trainable model scale as well.

Key words: acoustic model, speech recognition, triphone, tone

中图分类号: